15 research outputs found

    3DLigandSite: Structure-based prediction of protein-ligand binding sites

    Get PDF
    3DLigandSite is a web tool for the prediction of ligand-binding sites in proteins. Here, we report a significant update since the first release of 3DLigandSite in 2010. The overall methodology remains the same, with candidate binding sites in proteins inferred using known binding sites in related protein structures as templates. However, the initial structural modelling step now uses the newly available structures from the AlphaFold database or alternatively Phyre2 when AlphaFold structures are not available. Further, a sequence-based search using HHSearch has been introduced to identify template structures with bound ligands that are used to infer the ligand-binding residues in the query protein. Finally, we introduced a machine learning element as the final prediction step, which improves the accuracy of predictions and provides a confidence score for each residue predicted to be part of a binding site. Validation of 3DLigandSite on a set of 6416 binding sites obtained 92% recall at 75% precision for non-metal binding sites and 52% recall at 75% precision for metal binding sites. 3DLigandSite is available at https://www.wass-michaelislab.org/3dligandsite. Users submit either a protein sequence or structure. Results are displayed in multiple formats including an interactive Mol* molecular visualization of the protein and the predicted binding sites

    Sequence analysis of 43-year old samples of Plantago lanceolata show that Plantain virus X is synonymous with Actinidia virus X and is widely distributed

    Get PDF
    Plantain virus X was first recognized by the ICTV as a species in the genus Potexvirus in 1982. However, because no sequence was available for plantain virus X (PlVX), abolishing the species was proposed to the Flexiviridae working group of the ICTV in 2015. This initiated efforts to sequence the original isolates from Plantago lanceolata samples. Here we report the full‐genome sequencing of two original isolates of PlVX, which demonstrate that the virus is synonymous to Actinidia virus X, a species previously reported from kiwifruit (Actinidia sp.) and blackcurrant (Ribes nigrum). PlVX was previously noted to be widespread in the UK in P. lanceolata. This report additionally presents novel data on the distribution and diversity of PlVX, collected at the same site as the original UK isolates, and from three independent surveys, two in the Netherlands and one in Belgium. This study also includes two new host records for PlVX, Browallia americana and Capsicum annuum (sweet pepper), indicating the virus is more widespread and infects a broader range of hosts than previously reported. This stresses the importance of surveys of noncultivated species to gain insight into viral distribution and host range. This study also demonstrates the value of generating sequence data for isolates retained in virus collections. Additionally, it demonstrates the potential value in prepublication data sharing for giving context to virus detections such as the four independent studies here which, when combined, give greater clarity to the identity, diversity, distribution, and host range of plantain virus X

    PDBe-KB: collaboratively defining the biological context of structural data

    Get PDF
    The Protein Data Bank in Europe - Knowledge Base (PDBe-KB, https://pdbe-kb.org) is an open collaboration between world-leading specialist data resources contributing functional and biophysical annotations derived from or relevant to the Protein Data Bank (PDB). The goal of PDBe-KB is to place macromolecular structure data in their biological context by developing standardised data exchange formats and integrating functional annotations from the contributing partner resources into a knowledge graph that can provide valuable biological insights. Since we described PDBe-KB in 2019, there have been significant improvements in the variety of available annotation data sets and user functionality. Here, we provide an overview of the consortium, highlighting the addition of annotations such as predicted covalent binders, phosphorylation sites, effects of mutations on the protein structure and energetic local frustration. In addition, we describe a library of reusable web-based visualisation components and introduce new features such as a bulk download data service and a novel superposition service that generates clusters of superposed protein chains weekly for the whole PDB archive

    PDBe-KB: a community-driven resource for structural and functional annotations.

    Get PDF
    The Protein Data Bank in Europe-Knowledge Base (PDBe-KB, https://pdbe-kb.org) is a community-driven, collaborative resource for literature-derived, manually curated and computationally predicted structural and functional annotations of macromolecular structure data, contained in the Protein Data Bank (PDB). The goal of PDBe-KB is two-fold: (i) to increase the visibility and reduce the fragmentation of annotations contributed by specialist data resources, and to make these data more findable, accessible, interoperable and reusable (FAIR) and (ii) to place macromolecular structure data in their biological context, thus facilitating their use by the broader scientific community in fundamental and applied research. Here, we describe the guidelines of this collaborative effort, the current status of contributed data, and the PDBe-KB infrastructure, which includes the data exchange format, the deposition system for added value annotations, the distributable database containing the assembled data, and programmatic access endpoints. We also describe a series of novel web-pages-the PDBe-KB aggregated views of structure data-which combine information on macromolecular structures from many PDB entries. We have recently released the first set of pages in this series, which provide an overview of available structural and functional information for a protein of interest, referenced by a UniProtKB accession

    PDBe-KB: collaboratively defining the biological context of structural data

    Get PDF
    The Protein Data Bank in Europe – Knowledge Base (PDBe-KB, https://pdbe-kb.org) is an open collaboration between world-leading specialist data resources contributing functional and biophysical annotations derived from or relevant to the Protein Data Bank (PDB). The goal of PDBe-KB is to place macromolecular structure data in their biological context by developing standardised data exchange formats and integrating functional annotations from the contributing partner resources into a knowledge graph that can provide valuable biological insights. Since we described PDBe-KB in 2019, there have been significant improvements in the variety of available annotation data sets and user functionality. Here, we provide an overview of the consortium, highlighting the addition of annotations such as predicted covalent binders, phosphorylation sites, effects of mutations on the protein structure and energetic local frustration. In addition, we describe a library of reusable web-based visualisation components and introduce new features such as a bulk download data service and a novel superposition service that generates clusters of superposed protein chains weekly for the whole PDB archive

    Nerine potexvirus 1: a new Potexvirus species detected from Nerine in the United Kingdom

    No full text
    We identified full-length sequence of a potexvirus infecting Nerine plants in the UK. By sequencing viruses in historical collections we have been able to demonstrate that the identified virus, for which we propose the name nerine potexvirus 1, is a potentially novel species in the genus Potexvirus. Analysis of the potexviruses in the historic collections has also enabled us to generate the first full-length sequence of nerine virus X (NVX) to be isolated from Nerine; and to clarify the taxonomy of NVX isolates infecting Nerine and Agapanthus. Analysis of isolates from the historical collections has enabled us to link biological data gathered in the pre-genomic era to specific isolate sequences

    Genomic High Plains Wheat Mosaic Virus Sequences from Australia: Their Phylogenetics and Evidence for Emaravirus Recombination and Reassortment

    No full text
    High Plains wheat mosaic virus (HPWMoV) causes a serious disease in major wheat-growing regions worldwide. We report here the complete or partial genomic sequences of five HPWMoV isolates from Australian wheat samples. Phylogenetic analysis of the nucleotide sequences of the eight genomic segments of these five isolates together with others from Genbank found all eight genes formed two lineages, L1 and L2. L1 contained a single isolate from Colorado in the North American Great Plains Region (GPR), and L2 had two unresolved clusters, A and B, of isolates from Australia and the GPR. A quarter of the L2B isolate sequences of the nucleocapsid gene (RNA3) were recombinant, which is unexpected as little evidence of recombination exists in viruses with negative single-stranded RNA genomes. Phylogenies calculated from the amino acid sequences of HPWMoV’s RNA-dependent RNA-polymerase (RNA1), glycoprotein (RNA2), and nucleocapsid protein (RNA3) showed they were closest to those of Palo Verde broom virus. However, its movement protein (RNA4) was closer to those of Ti ringspot-associated and common oak ringspot-associated viruses, indicating the RNA4 segments of their ancestors reassorted to produce the current emaraviruses. To avoid increased yield losses from co-infection, biosecurity measures are advised to avoid HPWMoV introduction to countries where wheat streak mosaic virus already occurs

    Genomic High Plains Wheat Mosaic Virus Sequences from Australia: Their Phylogenetics and Evidence for Emaravirus Recombination and Reassortment

    No full text
    High Plains wheat mosaic virus (HPWMoV) causes a serious disease in major wheat-growing regions worldwide. We report here the complete or partial genomic sequences of five HPWMoV isolates from Australian wheat samples. Phylogenetic analysis of the nucleotide sequences of the eight genomic segments of these five isolates together with others from Genbank found all eight genes formed two lineages, L1 and L2. L1 contained a single isolate from Colorado in the North American Great Plains Region (GPR), and L2 had two unresolved clusters, A and B, of isolates from Australia and the GPR. A quarter of the L2B isolate sequences of the nucleocapsid gene (RNA3) were recombinant, which is unexpected as little evidence of recombination exists in viruses with negative single-stranded RNA genomes. Phylogenies calculated from the amino acid sequences of HPWMoV’s RNA-dependent RNA-polymerase (RNA1), glycoprotein (RNA2), and nucleocapsid protein (RNA3) showed they were closest to those of Palo Verde broom virus. However, its movement protein (RNA4) was closer to those of Ti ringspot-associated and common oak ringspot-associated viruses, indicating the RNA4 segments of their ancestors reassorted to produce the current emaraviruses. To avoid increased yield losses from co-infection, biosecurity measures are advised to avoid HPWMoV introduction to countries where wheat streak mosaic virus already occurs
    corecore